Search CORE

235 research outputs found

The Conditional Lucas & Kanade Algorithm

Author: D Marr
DG Lowe
E Antonakos
EP Simoncelli
Hilton Bristow
I Matthews
R Gross
S Baker
T Ojala
Publication venue
Publication date: 28/03/2016
Field of study

The Lucas & Kanade (LK) algorithm is the method of choice for efficient dense image and object alignment. The approach is efficient as it attempts to model the connection between appearance and geometric displacement through a linear relationship that assumes independence across pixel coordinates. A drawback of the approach, however, is its generative nature. Specifically, its performance is tightly coupled with how well the linear model can synthesize appearance from geometric displacement, even though the alignment task itself is associated with the inverse problem. In this paper, we present a new approach, referred to as the Conditional LK algorithm, which: (i) directly learns linear models that predict geometric displacement as a function of appearance, and (ii) employs a novel strategy for ensuring that the generative pixel independence assumption can still be taken advantage of. We demonstrate that our approach exhibits superior performance to classical generative forms of the LK algorithm. Furthermore, we demonstrate its comparable performance to state-of-the-art methods such as the Supervised Descent Method with substantially less training examples, as well as the unique ability to "swap" geometric warp functions without having to retrain from scratch. Finally, from a theoretical perspective, our approach hints at possible redundancies that exist in current state-of-the-art methods for alignment that could be leveraged in vision systems of the future.Comment: 17 pages, 11 figure

arXiv.org e-Print Archive

Crossref

CubeNet: Equivariance to 3D Rotation and Translation

Author: E Barnard
EP Simoncelli
GE Hinton
GS Chirikjian
I Arganda-Carreras
J Bruna
JL Crowley
N Srivastava
O Ronneberger
R Lenz
T Beier
T Lindeberg
WT Freeman
Publication venue
Publication date: 12/04/2018
Field of study

3D Convolutional Neural Networks are sensitive to transformations applied to their input. This is a problem because a voxelized version of a 3D object, and its rotated clone, will look unrelated to each other after passing through to the last layer of a network. Instead, an idealized model would preserve a meaningful representation of the voxelized object, while explaining the pose-difference between the two inputs. An equivariant representation vector has two components: the invariant identity part, and a discernable encoding of the transformation. Models that can't explain pose-differences risk "diluting" the representation, in pursuit of optimizing a classification or regression loss function. We introduce a Group Convolutional Neural Network with linear equivariance to translations and right angle rotations in three dimensions. We call this network CubeNet, reflecting its cube-like symmetry. By construction, this network helps preserve a 3D shape's global and local signature, as it is transformed through successive layers. We apply this network to a variety of 3D inference problems, achieving state-of-the-art on the ModelNet10 classification challenge, and comparable performance on the ISBI 2012 Connectome Segmentation Benchmark. To the best of our knowledge, this is the first 3D rotation equivariant CNN for voxel representations.Comment: Preprin

arXiv.org e-Print Archive

Crossref

UCL Discovery

Culture shapes how we look at faces

Author: A Chauvin
A Norenzayan
AG Goldstein
AG Goldstein
AL Yarbus
Alex O. Holcombe
CA Feingold
CA Meissner
Caroline Blais
Christoph Scheepers
CL Kleinke
D Lundqvist
Daniel Fiset
EP Simoncelli
G Walker-Smith
HF Chua
J Cohen
JM Findlay
JM Henderson
JM Henderson
M Argyle
R Caldara
R Caldara
R Caldara
R Groner
Rachael E. Jack
RE Nisbett
RE Nisbett
Roberto Caldara
RR Althoff
RS Malpass
S Bang
S Kitayama
SW Janik
T Masuda
Y Miyamoto
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2008
Field of study

Background: Face processing, amongst many basic visual skills, is thought to be invariant across all humans. From as early as 1965, studies of eye movements have consistently revealed a systematic triangular sequence of fixations over the eyes and the mouth, suggesting that faces elicit a universal, biologically-determined information extraction pattern. Methodology/Principal Findings: Here we monitored the eye movements of Western Caucasian and East Asian observers while they learned, recognized, and categorized by race Western Caucasian and East Asian faces. Western Caucasian observers reproduced a scattered triangular pattern of fixations for faces of both races and across tasks. Contrary to intuition, East Asian observers focused more on the central region of the face. Conclusions/Significance: These results demonstrate that face processing can no longer be considered as arising from a universal series of perceptual events. The strategy employed to extract visual information from faces differs across cultures

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Enlighten

The biorthogonal wavelets that are redundant-free and nearly shift-insensitive

Author: A Abbate
A Cohen
BN Kingsbury
C-M Pun
EP Simoncelli
FCA Fernandes
G De Grandi
Hongli Shi
HS Stone
I Daubechies
I Zavorin
IW Selesnick
J Ma
MN Do
R Eslami
RA Gopinath
Shuqian Luo
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Saccadic facilitation by modulation of microsaccades in natural backgrounds

Author: A Schaaf van der
AB Watson
AD Straw
BB Mandelbrot
BJ White
BL Zuber
DJ Field
EP Simoncelli
M Rolfs
P Dayan
PE King-Smith
Petra Sinn
R Engbert
R Engbert
R Engbert
Ralf Engbert
S Martinez-Conde
S Martinez-Conde
VA Billock
Publication venue: Springer-Verlag
Publication date: 01/01/2011
Field of study

Saccades move objects of interest into the center of the visual field for high-acuity visual analysis. White, Stritzke, and Gegenfurtner (Current Biology, 18, 124–128, 2008) have shown that saccadic latencies in the context of a structured background are much shorter than those with an unstructured background at equal levels of visibility. This effect has been explained by possible preactivation of the saccadic circuitry whenever a structured background acts as a mask for potential saccade targets. Here, we show that background textures modulate rates of microsaccades during visual fixation. First, after a display change, structured backgrounds induce a stronger decrease of microsaccade rates than do uniform backgrounds. Second, we demonstrate that the occurrence of a microsaccade in a critical time window can delay a subsequent saccadic response. Taken together, our findings suggest that microsaccades contribute to the saccadic facilitation effect, due to a modulation of microsaccade rates by properties of the background

CiteSeerX

Crossref

PubMed Central

Optimal measurement of visual motion across spatial and temporal scales

Author: A Gorban
A Gorban
A Jones
AB Watson
AJ Doorn van
AJ Doorn van
B Krekelberg
B Sakitt
CW Gardiner
CWG Clifford
D Gabor
D Marr
DH Kelly
DH Kelly
DJ Field
DM MacKay
DO Hebb
EL Bienenstock
EP Simoncelli
ET Jaynes
G Bi
HL Resnikoff
JG Daugman
K Nakayama
LA Lesmes
M Kubovy
M Vergassola
M Wertheimer
MJ Wainwright
MS Landy
O Paulsen
P Burt
P Jurica
RD Luce
S Gepshtein
S Gepshtein
S Gepshtein
S Gepshtein
S Gepshtein
S Marcelja
S Saremi
SB Laughlin
SB Laughlin
TM Cover
VD Glezer
Y Weiss
Y Yeshurun
Y Yeshurun
Publication venue
Publication date: 01/01/2014
Field of study

Sensory systems use limited resources to mediate the perception of a great variety of objects and events. Here a normative framework is presented for exploring how the problem of efficient allocation of resources can be solved in visual perception. Starting with a basic property of every measurement, captured by Gabor's uncertainty relation about the location and frequency content of signals, prescriptions are developed for optimal allocation of sensors for reliable perception of visual motion. This study reveals that a large-scale characteristic of human vision (the spatiotemporal contrast sensitivity function) is similar to the optimal prescription, and it suggests that some previously puzzling phenomena of visual sensitivity, adaptation, and perceptual organization have simple principled explanations.Comment: 28 pages, 10 figures, 2 appendices; in press in Favorskaya MN and Jain LC (Eds), Computer Vision in Advanced Control Systems using Conventional and Intelligent Paradigms, Intelligent Systems Reference Library, Springer-Verlag, Berli

arXiv.org e-Print Archive

CiteSeerX

Crossref

King's Research Portal

Adaptive Filtering Enhances Information Transmission in Visual Cortex

Author: A Kohn
AA Emondi
AB Saul
AL Fairhall
Andrei V. Kurgansky
BC Skottun
C Blakemore
D Chander
D Smyth
DG Albrecht
DJ Field
DJ Field
DL Ringach
DL Ruderman
DW Dong
EP Simoncelli
F Rieke
FE Theunissen
FE Theunissen
G Felsen
Hiroki Sugihara
I Ohzawa
JA Movshon
JD Victor
JJ Atick
Kenneth D. Miller
L Maffei
Michael P. Stryker
MJ Nolt
MJ Wainwright
MN Kvale
MP Sceniak
N Brenner
N Brenner
NC Rust
R Baddeley
R de Boer
R Shapley
RM Shapley
SA Baccus
Sergei P. Rebrik
SG Solomon
SM Smirnakis
SP Brown
SV David
T Hosoya
T Sharpee
Tatyana O. Sharpee
V Dragoi
Y Dan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 09/11/2006
Field of study

Sensory neuroscience seeks to understand how the brain encodes natural environments. However, neural coding has largely been studied using simplified stimuli. In order to assess whether the brain's coding strategy depend on the stimulus ensemble, we apply a new information-theoretic method that allows unbiased calculation of neural filters (receptive fields) from responses to natural scenes or other complex signals with strong multipoint correlations. In the cat primary visual cortex we compare responses to natural inputs with those to noise inputs matched for luminance and contrast. We find that neural filters adaptively change with the input ensemble so as to increase the information carried by the neural response about the filtered stimulus. Adaptation affects the spatial frequency composition of the filter, enhancing sensitivity to under-represented frequencies in agreement with optimal encoding arguments. Adaptation occurs over 40 s to many minutes, longer than most previously reported forms of adaptation.Comment: 20 pages, 11 figures, includes supplementary informatio

arXiv.org e-Print Archive

Crossref

Longer fixation duration while viewing face images

Author: A Pollatsek
A Yarbus
AM Burton
AM Jacobs
B Rossion
C Turati
D Maurer
D Parkhurst
DI Perrett
DJ Field
DJ Parkhurst
DL Ruderman
DY Tsao
E Kreyszig
EP Simoncelli
G Krieger
G McCarthy
GA Rousselet
Hooge IThC
I Biederman
I Gauthier
I Gauthier
J Epelboim
J Sergent
J Sergent
JM Fellous
JM Henderson
JM Henderson
JM Henderson
JR Anderson
K Guo
K Guo
K Moffit
K Tanaka
Kun Guo
LA Parr
M Moscovitch
Malcolm P. Young
MH Johnson
MJ Farah
MJ Mendelson
MJ Tarr
MP Young
NJ Emery
P Reinagel
PMJ Diepen van
RJ Andrew
RJ Itier
RK Yin
Robert G. Robertson
S Bentin
SA Rosenfeld
Sasan Mahmoodi
T Valentine
TA Salthouse
TJ Andrews
V Bruce
W Einhäuser
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2006
Field of study

The spatio-temporal properties of saccadic eye movements can be influenced by the cognitive demand and the characteristics of the observed scene. Probably due to its crucial role in social communication, it is argued that face perception may involve different cognitive processes compared with non-face object or scene perception. In this study, we investigated whether and how face and natural scene images can influence the patterns of visuomotor activity. We recorded monkeys’ saccadic eye movements as they freely viewed monkey face and natural scene images. The face and natural scene images attracted similar number of fixations, but viewing of faces was accompanied by longer fixations compared with natural scenes. These longer fixations were dependent on the context of facial features. The duration of fixations directed at facial contours decreased when the face images were scrambled, and increased at the later stage of normal face viewing. The results suggest that face and natural scene images can generate different patterns of visuomotor activity. The extra fixation duration on faces may be correlated with the detailed analysis of facial features

University of Lincoln Institutional Repository

Southampton (e-Prints Soton)

Crossref

Local biases drive, but do not determine, the perception of illusory trajectories

Author: AL Cesàro
T Lipps
E Burgmester
F Zöllner
MB Fineman
P Wenderoth
SK Khuu
SK Khuu
C Blakemore
S Nundy
MA Changizi
MA Changizi
CQ Howe
M Farrell-Whelan
M Farrell-Whelan
MR Blakemore
S Anstis
R Cormack
H Ito
MEJ Masson
EH Adelson
HR Wilson
C Yo
EP Simoncelli
K Amano
K Amano
MP Sceniak
MJ Hawken
M Wertheimer
R Shepard
S-H Kim
S-H Kim
A Larsen
P Stumpf
H Wallach
S Wuerger
BG Cleland
C Walsh
M Edwards
SMLetter Anstis
DM Levi
DM Levi
P Mäkelä
CA Johnson
WA van de Grind
RC Emerson
U Tulunay-Keesey
MH Pirenne
S Anstis
NSC Price
NJ Priebe
NJ Priebe
JA Perge
DH Brainard
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

When a dot moves horizontally across a set of tilted lines of alternating orientations, the dot appears to be moving up and down along its trajectory. This perceptual phenomenon, known as the slalom illusion, reveals a mismatch between the veridical motion signals and the subjective percept of the motion trajectory, which has not been comprehensively explained. In the present study, we investigated the empirical boundaries of the slalom illusion using psychophysical methods. The phenomenon was found to occur both under conditions of smooth pursuit eye movements and constant fixation, and to be consistently amplified by intermittently occluding the dot trajectory. When the motion direction of the dot was not constant, however, the stimulus display did not elicit the expected illusory percept. These findings confirm that a local bias towards perpendicularity at the intersection points between the dot trajectory and the tilted lines cause the illusion, but also highlight that higher-level cortical processes are involved in interpreting and amplifying the biased local motion signals into a global illusion of trajectory perception

CiteSeerX

Crossref

Sheffield Hallam University Research Archive

Local biases drive, but do not determine, the perception of illusory trajectories

Author: A Larsen
AL Cesàro
BG Cleland
C Blakemore
C Walsh
C Yo
CA Johnson
CQ Howe
DH Brainard
DM Levi
DM Levi
E Burgmester
EH Adelson
EP Simoncelli
F Zöllner
H Ito
H Wallach
HR Wilson
JA Perge
K Amano
K Amano
M Edwards
M Farrell-Whelan
M Farrell-Whelan
M Wertheimer
MA Changizi
MA Changizi
MB Fineman
MEJ Masson
MH Pirenne
MJ Hawken
MP Sceniak
MR Blakemore
NJ Priebe
NJ Priebe
NSC Price
P Mäkelä
P Stumpf
P Wenderoth
R Cormack
R Shepard
RC Emerson
S Anstis
S Anstis
S Nundy
S Wuerger
S-H Kim
S-H Kim
SK Khuu
SK Khuu
SMLetter Anstis
T Lipps
U Tulunay-Keesey
WA van de Grind
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 08/05/2020
Field of study

Crossref

Sheffield Hallam University Research Archive

Leicester Research Archive